Efficient XML Path Filtering Using GPUs
نویسندگان
چکیده
Publish-subscribe (pub-sub) systems present the state of the art in information dissemination to multiple users.Current XML-based pub-sub systems provide users with considerable flexibility allowing the formulation of complex queries on the content as well as the structure of the streaming messages. Messages that contain one or more matches for a given user profile (query) are forwarded to the user. Recently various approaches focused on accelerating XML path query filtering using dedicated hardware architectures, like FPGAs. Despite their very high throughput, FPGAs require extensive update time while their physical resource availability is also limited. In this paper we exploit the parallelism found in XPath filtering systems using GPUs instead, which are favorable platforms due to the massive parallelism found in their hardware architecture, alongside the flexibility and programmability of software. By utilizing properties of the GPU memory hierarchy we can match thousands of user profiles at high throughput, requiring minimal update time. E cient common prefix optimizations are also applied to the query set. An extensive experimental evaluation shows an average speedup of 10x (up to 2.5 orders of magnitude) versus the state of the art software approaches.
منابع مشابه
High-Performance Holistic XML Twig Filtering Using GPUs
Current state of the art in information dissemination comprises of publishers broadcasting XML-coded documents, in turn selectively forwarded to interested subscribers. The deployment of XML at the heart of this setup greatly increases the expressive power of the profiles listed by subscribers, using the XPath language. On the other hand, with great expressive power comes great performance resp...
متن کاملTime/Space Efficient Filtering of Streaming XML Documents Using Incrementally Constructed Path-trie
In this paper, we present a streaming XML document filter named DXAXEN which is based on incremental construction of path-trie. It runs very fast, and processes a large number of XPath queries efficiently. Experimental comparison with XMLTK, a well-known streaming XML document filter, shows that DXAXEN is 2–5 times faster and needs only 5–20 percent of memory.
متن کاملEfficient Filtering of XML Documents with XPath Expressions
We propose a novel index structure, termed XTrie, that supports the efficient filtering of XML documents based on XPath expressions. Our XTrie index structure offers several novel features that make it especially attractive for largescale publish/subscribe systems. First, XTrie is designed to support effective filtering based on complex XPath expressions (as opposed to simple, single-path speci...
متن کاملHigh-Performance XML Filtering: An Overview of YFilter
We have developed YFilter, an XML filtering system that provides fast, on-the-fly matching of XMLencoded data to large numbers of query specifications containing constraints on both structure and content. YFilter encodes path expressions using a novel NFA-based approach that enables highly-efficient, shared processing for large numbers of XPath expressions. In this paper, we provide a brief tec...
متن کاملHigh-Performance XML Twig Filtering using GPUs
Current state of the art in information dissemination comprises of publishers broadcasting XML-coded documents, in turn selectively forwarded to interested subscribers. The deployment of XML at the heart of this setup greatly increases the expressive power of the profiles listed by subscribers, using the XPath language. On the other hand, with great expressive power comes great performance resp...
متن کامل